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Developing Processing Techniques for Rbylab Data 
Monthly I’rogieoo Report, June 1975 


The following report servos os the twenty- eighth monthly progress 
report for LillU’ Invest it* 1 cion AS 6 M which is entitled "Developing 
Processing Techniques fer Skylab Dot. The financial report for this 
contract (NAS9-1 3280) is bring submitted under separate cover. 

The purpose of this investigation is to test information extraction 
techniques for Rift LA] > R-192 data end compare with results obtained la 
applying these techniques to LANDSAT and aircraft scanner data. 

During the reporting period progress was made in the continuirg 
analysts of the effects oJ spatial ini sregistration ol SDOs on boundary 
pixels and in tin implementation of the mixtures classifier. Additionally 
processing was begun on the signature extension task. 

JKtf o < t_';_ of j i,r>. :ii. i - to. -f U an >oJ S par i nj_ Ml sregist r ati o n on C ir orl I ierf te n 

Af. e»r ..cy and mi r«-i l n.ir t:on Hs t i mati on 

Previous reports have described simulation techniques dcve.loped to 
investigate effects ol channcl-to-channel misregistration and experiment 
designs to aid in the analysis of cny effects. A discusrion has been 
presented on the effects of spatial misregistration on field-center 
clasc 1 fi curie! accuracy. During this report period further data pro- 
cessing stages o: the experiment were carried out, and an nnalyris was 
mode in an effort to study the. effects of clienr.ei -to-channei misr< gistrntion 
on border (mixture) pixels. A discussion of this analysis follows. 

The study conducted centered on the analy Is of four basic problem: 

(3) rhe effect of misregistration on the class.! H cation of a mixture pixel 
of two ground covers; (2) the effect of misregistration on the false alarm 
rate of any given crop among mixtures of two other ground covers; (3) the 
effect of misregistration on proportion estimation; and (A) the effect of 
misregistration on the proportion of Held ceati r pixels in the data. In 
tills report wc. shall deal with the first three analyses. The fourth prob.’ .ui 
will be discussed next m*nth. 

In ord.-. io study those four problems, the experiment described in 
the April monthly v. s cat* i *d out. two types of signature simulations 
were required. First, signatures representing f 4 eid center distributions 
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misregistered for factors of 1/3, 2/3 and 1 whole pixel in SDOs 2, 12 
and 17 were calculated. New distributions representing mixtures of ull 
permutations of two ground covers for varying proportions were (simulated 
ar follows. Let o jA and be tlse proportions of distributions A and B 

in the it* 1 ctvinnol used to simulate a mixture of ground covers A and B. 

For perfectly registered signatures, was set to 2/3, 1/3 and 0 for 

every channel i. However for misregistered signatures, the channels out 
of registration would be in different proportions. For example, if a 
signature was misregisteied by 1/2 a pixel the proportion of cover type A 
would be - 1/2. Hence any field-center pixels in the registered ense 

within 1/2 pixel of the boundary would become mixture pixels in the rnis- 
registcred case. (In effect there would be fewer field center pixels.) 
Therefore signatures representing mixtures of misreglstercd distributions 
were simulated with proportions of and in the registered channels i 

and - B, + 3 in the misregistered channels J where B is the degree 

oi misregistration. 

Once the- simulated signatures were attained the program FIX was run 
to calculate fho expected performance for each pet of gnatutes repre- 
senting a given misregistration. That is, given the best linear decision 
boundaries between the 5 field-center signatures, what will the expected 
classification of mixture pixels appear to be. It should be mentioned 
at this point that processing w..s also carried ouL with orly one channel 
misregistered, hut analysis has not been completed and only’ the analysis 
of the three-channel case will he presented. 

In order to simplify the discussion of the results, the presentation 
here will center only on the effects noted for brush and grass mixture 
pixels interacting with brush, grass and corn signatures, where the 
channels were nr.t uirc-d mi c re ni 8 to red from the brush into the grass ground 
cover. Tills example was chosen for presentation here because corn, grass 
and trees comprise almost three-fourths of the scene. It represents 
neither one extreme nor the other — it is in fact fairly typical of the 
whole ; tuoy . What will be particularly noted is (3) the effects on the 
classification of brush-grass mixtures is either brush or grass, (2) the 
corn false alarm rute, and ( 3 ) the effect on proportion estimation. 

Three sets of curves will be presented. These curves display the expected 
performance of brush und brush-grass pixels as a function of the proportion 
of each crop type. In a sen-.e one could envision, as an aid in studying 
these graphs, a resolution element moving across a fixed field boundary 
and at various locations the. expected pro (ability of that resolution element's 
classification would he calculated. Note in each of the following graphs 
a zone representing pete field center pixels in the registered case has been 


s* 


f OMttl Ml l tKUl A Mi jN iAfekil Imi iiNtvl M .III i<» MM »m«.an 


101900-59-L 
Tagc 4 

labelled as well os an area representing mixtures of varying degrees. The 
width of these zo les is exactly one pixel and the field boundary would 
appear at the po'.nt (1/2, 1/2) op. the ordinate. 

Figure 1 displays three graphs, one for each degree of misregistration 
considered, plotting the expected probability of classifying brush and 
brush-gt ass mixtures as brush (the solid line) or grass (the dashed line). 

In Figure 1-A, on top, one notes that in the area designated brush, these 
field center pixels ure for the most part classified as brush. As tbo 
mixture of brush and grass becomes predominantly grass, the performance 
curve increases for grass and decreases for brush. Comparing Figure 1-A 
with Figures 1-B and 1-C (i.e., as misregistration increases), one notices 
immediately that the field center brush pixels are not recognized as brush 
with as much consistency. The expected performance of those most near the 
border deteriorates from around &0% to about 15%. Ibis indicates that 
misregistration does effect the correct classification of field center 
pixels to a significant degree. 

Figure 2 displays the expected probability of classifying a brush 
or brush-grass pixel n 3 corn. Even in the registered case the corn false 
alarms among brush-grass pixels are significant. As misregistration is 
introduced, more nnci more corn false alarms occur among pixels that were 
pure field center brush pixels in the registered case. In fact those most 
near the border are called corn with up to 40% regularity. In view of 
this graph alone one cannot dismiss the significant increase in corn false 
alarms introduced by misregistration of the data. 

A question of obvious concern is to what extent proportion estimation 
is affected by these effects of channel-to-channel misregistration. It is 
argued generally that errors tend to compensate for other errors: that is, 
errors are made uniformly in all directions and over a large sample their 
affects will bo cancelled. The surprising com false alarm rote among 
registered pixels of brush-grass already nukes the process of proportion 
estimation a less than exact science. The increased number of false alarms 
to he expected with the introduction of misregistration place* even more 
reliance on compensating errors. 

Figure 3 is presented to show that the errors introduced are not 
strictly compensatory for proportion estimation, especially when misregis- 
tration is Introduced in the scene. let us focus our attention on the 
estimation of the proportion of corn. Noting an increased rate of corn 
false alarms among brush-grass pixels, these would necessarily have to be 
compensated for by a decrease in the correct classification of corn or 
mixtures ol corn-other pixels (here we use the expression coirect classi- 
fication 'o flic scr.se that mixtures of two covers A and B are classified 
as eitlur A or C) . Figure 3 is a graph of Lite expected probability of "correct" 


FIGURE 1. FXTECTED CLASSIFICATION PERFORMANCE OF BRUSH, 
BRUSH-CRASS MIXTURE PIXELS 
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classification of two ground covers as labelled as a function of the 
mixture proportion. The solid line Indicates the amount of brush-grass 
correctly classified. With more misregistration there are more false 
alarms particularly of corn* as previously noted. However the correct 
classification of corn, corn-grass or corn-brush pixels does not corre- 
spondingly decrease Indicating that corn Tiny be overestimated In tho 
given scene. 

One can conclude from the evidence presented that the effects of 
misregistration are indeed significant in a detrimental manner in 
(1) the correct classification of border and near-border pixels; (2) in 
increasing the rate of false alarms, and (3) in proportion estimation. 


Subrv solution Elem en t Proccs sing. 

Last month wc reported on initial geometrical analysis of the set 
of spectral signatures extracted from the data for use with the mixtures 
classifier. During the current period we continued by testing the 
chosen subset or signatures on a small portion of the data. Time was 
spent adapting the necessary software for use with S-192 data. 

Before proceeding further, a short explanation of the manner in 
which the mixtures piocer»«.or Is applied is in order. 

It is obvious that a pixel may bo purely or almost purely of one 
ground class, or it may be a mixture of several ground classes. Thus 
the algorithm used, as its first stage, determines the several likeliest 
pof.rtlbillt us. First, the most probuble tingle signature for a pixel, 
and the attendant chi-square value arc determined. 7hon the program 
determines the most probable pair of signatuics, the proportion of caU* 
cover in the pixel, end an associated chi-square value. The pixel may 
be further analyzed as a mixture of three and four covers. For reasons 
of processing time and computer space requirements this study will limit 
the hypothesis to pure or two class mixture pixels. Thin is not an 
unreal i: t ic restriction when one conside rs the scan swath over the ground. 
For an agricultural area like the current data set, moat mixture pixels 
will occur at field boundaries such that the vast majority of mixture 
pixel: will be mixtures of two ground classes. (In general, it is expected 
that 25-40% of tho pixels In an agricultural data sot will be mixture 
pixels . ) 

The data are then processed through a second stage where a pixel is 
determined to be a pure pix«*l if the chi-square value for the vanning 
pure case is lees than 9 ome threshold t., . If it is not pure according 
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to this test, then the cht square value for the two clasa mixture case 
1 s compared to a second threshold If it is less than t 2 » the pixel 

is determined to bo the mixture indicated; otherwise the pixel i6 con- 
sidered to be from a class or classes not included in the signature set* 

Currently the thresholds i^, nre c ^ ,oscn empirically — hence the 
need for two processing stapes. The parameters tj and are chosen so 

as to minimi z*. the error of the proportion estimate over some training 
area of known proportion. 

The signature set described last month was applied to a small 550 
pixel section of the data. Subsequent analysis showed that very little 
of the data were being called out as grass, and au a result the error 
rate was substanti 1. It seemed that the Initial choice of a glass 
sign; lure was a poor one (sec last month's discussion). Accordingly, a 
different grasp signature was selected, this one being the grabs cluster 

. 

threug*- r, v: mixtures classifier. The results were somewhat better, but 
tfi“ total ^rror in lh< proportion eetiraation for the tent date was still 
slightly inferior to the error rate achieved using the normal, linear 
maximum likelihood, classifier. It was furthei noted that the chi square 
thresholds c'.iocen, which minimized he total error of the proportion 
, ted • being couni “pure" and 

18% of the pixels being assessed as mixtures. Many more mixture pixels 
had been anticipated. One hypothesis for these results is that thi 
distribution.'! represented by the signatures are both close together and 
very broad. Thus, perhaps many of the mixture pixels are very near the 
center of some other distribution. Also, the regular maximum likelihood 
classification had been done using 15 signatures — for the mixtures 
approach we are using only six. It seems that it may be necessary to 
further pack the signature simplex with othei g’ass signatures so as to 
increase the grass classification rate. 

During the coming month we intend to continue in the analysis of 
these results and co icr.ce further processing. 


Signat ure K : t e nnlori 


One data processing technique to reduce the cost of processing data 
acquired over cither large areas or many similar sites, and to decrease 
the time required to perform the processing, is by utilizing one set of 
training data to process all the ditu collected. Since multispccl ml 
scanner data axe affected by many natural phenomena which serve to greatly 
increase the variability In the datal-i, the ability to extend the 
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applicability of spectral signatures over *» number of data sets or over 
large variations in the radiation sensed requires the use of special 
algorithms to adapt the training information to the data or vice versa, 

For this contract we intend to develop the capability to extend 
the applicability of training information for SK.YLAB S-192 data. 

For this task we have defined a training area and a test area. The 
training will be carried out over the area in and around Lansing, Michigan 
Primary ground classes are dense urban, residential, trees, general vege- 
tation, bare soil, and water. The test area will include Ypsilanti, 
Michigan and the area immediately to the east. This area has the same 
types of ground classes as the training area. The sites, which are both 
included in the S-192 data set we have been working with, are located 
approximately 80 miles apart. Additionally, the atmospheric state for 
the two sites is quite different. Examination of S—190B imagery acquired 
at the same time as the scanner data shows clearly that the training site 
is cloud free, and the atmosphere appears very clear, while the test site 
is covered by a considerable haze layer and is surrounded by clouds. 

Processing for the signature extension tack will include clustering 
for signatures and subsequent classification for both the training area 
and the test area. Then the signatures from the training area will be. 
used without any transformation to classify the test site, for use as a 
bench mark during subsequent analysis. Then the signatures will be 
modified by various signature extension transformations and used to 
process the test site data. 

The transformations tested will include the MASC algorithm^, and 
adaptive classification!^] , both developed at BRIM. Further signature 
extension techniques will be devised and tested as time allows . 

Initial processing was begun by clustering an area in the test site. 
There had been some apprehension that the haze layer would sufficiently 
affect the reflected radiation that classification could not be adequately 
carried out. The results of the clustering were good and showed that the 
area can be well classified. 

During the coming month we will continue processing in accordance 
with the plan outlined above . 
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